Multivariate regression analysis of distance matrices for testing associations between gene expression patterns and related variables.
نویسندگان
چکیده
A fundamental step in the analysis of gene expression and other high-dimensional genomic data is the calculation of the similarity or distance between pairs of individual samples in a study. If one has collected N total samples and assayed the expression level of G genes on those samples, then an N x N similarity matrix can be formed that reflects the correlation or similarity of the samples with respect to the expression values over the G genes. This matrix can then be examined for patterns via standard data reduction and cluster analysis techniques. We consider an alternative to conventional data reduction and cluster analyses of similarity matrices that is rooted in traditional linear models. This analysis method allows predictor variables collected on the samples to be related to variation in the pairwise similarity/distance values reflected in the matrix. The proposed multivariate method avoids the need for reducing the dimensions of a similarity matrix, can be used to assess relationships between the genes used to construct the matrix and additional information collected on the samples under study, and can be used to analyze individual genes or groups of genes identified in different ways. The technique can be used with any high-dimensional assay or data type and is ideally suited for testing subsets of genes defined by their participation in a biochemical pathway or other a priori grouping. We showcase the methodology using three published gene expression data sets.
منابع مشابه
Assessing Experimental and Intelligent Models in Estimating Reference Evapotranspiration
Introduction: As the most important element in the hydrologic cycle which depends on climate variables such as near-ground wind speed, air temperature, solar radiation, and relative humidity, reference evapotranspiration (ET0) is normally computed through a variety of methods, each of which requires different and in some cases extensive data that are unavailable in many circumstances, especial...
متن کاملInformation and Covariance Matrices for Multivariate Pareto (IV), Burr, and Related Distributions
Main result of this paper is to derive the exact analytical expressions of information and covariance matrix for multivariate Pareto, Burr and related distributions. These distributions arise as tractable parametric models in reliability, actuarial science, economics, finance and telecommunications. We showed that all the calculations can be obtained from one main moment multidimensional integr...
متن کاملA method for assessing phylogenetic least squares models for shape and other high-dimensional multivariate data.
Studies of evolutionary correlations commonly use phylogenetic regression (i.e., independent contrasts and phylogenetic generalized least squares) to assess trait covariation in a phylogenetic context. However, while this approach is appropriate for evaluating trends in one or a few traits, it is incapable of assessing patterns in highly multivariate data, as the large number of variables relat...
متن کاملMixed Lineage Kinase Domain-Like Pseudokinase (MLKL) Gene Expression in Human Atherosclerosis with and without Type 2 Diabetes Mellitus
Background: MLKL, one of the main downstream components of the necroptosis or programmed necrosis has recently been demonstrated in advanced atherosclerotic lesions. However, its precise role in the atherosclerosis pathogenesis still requires more elucidation. Our study was set to delineate both the changes in peripheral MLKL gene expression and its influence on disease severity in atherosclero...
متن کاملDetermining Effective Factors on Forest Fire Using the Compound of Multivariate Adaptive Regression Spline and Genetic Algorithm, a Case Study: Golestan, Iran
Determining Effective Factors on Forest Fire Using the Compound of Multivariate Adaptive Regression Spline and Genetic Algorithm, a Case Study: Golestan, Iran Pahlavani, P., Assistant professor at School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran Raei, A., PhD Candidate of GIS at School of Surveying and Geospatial Engineering, College of Engineeri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proceedings of the National Academy of Sciences of the United States of America
دوره 103 51 شماره
صفحات -
تاریخ انتشار 2006